PyDigger - unearthing stuff about Python


NameVersionSummarydate
similaripy 0.2.5 High-performance KNN similarity functions in Python, optimized for sparse matrices 2025-10-25 15:35:39
leksara 0.2.2 Library pemrosesan teks Bahasa Indonesia untuk domain e-commerce (cleaning, PII masking, review mining, pipeline). 2025-10-25 05:18:43
rindle 0.1.0 Dataset preparation library with Python bindings for sliding window tensors 2025-10-22 19:12:25
unicode-sanity 0.1.0 Clean invisible/control Unicode characters, normalize text, and optionally convert/remove emoji — with an explainable report. 2025-10-22 16:46:12
opler 0.0.1 A local, offline-first entity and place resolution system that maps messy place/entity strings and codes to canonical entities with calibrated confidence scores 2025-10-22 14:24:23
audiotame 0.1.16 Command-line tool that normalizes audio and reduces noise. 2025-10-19 12:40:03
charset-normalizer 3.4.4 The Real First Universal Charset Detector. Open, modern and actively maintained alternative to Chardet. 2025-10-14 04:42:32
reg-normalizer 1.0.8 Tool for normalizing and standardizing Russian region names 2025-10-13 15:31:10
intelli3text 0.2.5 Ingestion (web/PDF/DOCX/TXT), cleaning, paragraph-level LID (PT/EN/ES), and spaCy-based normalization; PDF export. 2025-10-13 00:46:31
textprettify 1.0.0 A comprehensive Python library for text formatting, transformation, and analysis 2025-10-08 11:05:34
cmai 0.1.6 AI Powered Commit Message Normalization Tool 2025-09-08 15:19:23
oxidize-postal 0.1.2 High-performance postal address parser and normalizer using libpostal with Rust bindings 2025-09-06 22:14:52
email-typo-fixer 1.1.2 A Python library to automatically detect and fix common typos in email addresses 2025-09-04 20:37:27
inoutlists 1.0.3 inoutlists is a python package to parse and normalize different sources of lists (OFAC, EU, UN, etc) to a common dictionary interface. 2025-08-25 20:37:13
tidyname 0.1.0 Intelligent company name cleaning for Python 2025-08-22 00:52:05
tk-normalizer 1.0.1 URL normalization library for consistent URL representation 2025-08-20 16:50:57
yosina 0.1.0 Japanese text transliteration library 2025-08-19 18:28:38
joyokanji 1.1.0 The joyokanji converts old-form kanji characters into new-form kanji characters. 2025-08-13 13:48:30
easy-unet 0.1.0 A lightweight and flexible PyTorch library providing a modular UNet implementation with advanced attention and normalization blocks for fast image processing and deep learning development. 2025-08-11 09:47:38
recollapse 1.0.0 REcollapse is a helper tool for black-box regex fuzzing to bypass validations and discover normalizations in web applications 2025-08-07 19:22:31
hourdayweektotal
8716209978331810
Elapsed time: 9.13268s